Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 462 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 36.2 KiB |
| Average record size in memory | 80.3 B |
Variable types
| NUM | 8 |
|---|---|
| BOOL | 2 |
Reproduction
| Analysis started | 2020-08-25 01:48:27.925074 |
|---|---|
| Analysis finished | 2020-08-25 01:48:38.117122 |
| Duration | 10.19 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
Sbp
Real number (ℝ≥0)
| Distinct count | 62 |
|---|---|
| Unique (%) | 13.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 138.32683982683983 |
|---|---|
| Minimum | 101 |
| Maximum | 218 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.7 KiB |
Quantile statistics
| Minimum | 101 |
|---|---|
| 5-th percentile | 112 |
| Q1 | 124 |
| median | 134 |
| Q3 | 148 |
| 95-th percentile | 176 |
| Maximum | 218 |
| Range | 117 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 20.49631718 |
|---|---|
| Coefficient of variation (CV) | 0.1481731037 |
| Kurtosis | 1.781646545 |
| Mean | 138.3268398 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 1.180590625 |
| Sum | 63907 |
| Variance | 420.0990178 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 134 | 29 | 6.3% | |
| 136 | 29 | 6.3% | |
| 128 | 25 | 5.4% | |
| 132 | 24 | 5.2% | |
| 124 | 21 | 4.5% | |
| 118 | 21 | 4.5% | |
| 126 | 20 | 4.3% | |
| 130 | 20 | 4.3% | |
| 138 | 18 | 3.9% | |
| 122 | 17 | 3.7% | |
| 120 | 14 | 3.0% | |
| 142 | 14 | 3.0% | |
| 148 | 12 | 2.6% | |
| 140 | 12 | 2.6% | |
| 114 | 12 | 2.6% | |
| 146 | 11 | 2.4% | |
| 144 | 10 | 2.2% | |
| 154 | 10 | 2.2% | |
| 162 | 9 | 1.9% | |
| 160 | 9 | 1.9% | |
| 152 | 8 | 1.7% | |
| 116 | 8 | 1.7% | |
| 166 | 8 | 1.7% | |
| 108 | 7 | 1.5% | |
| 112 | 7 | 1.5% | |
| Other values (37) | 87 | 18.8% |
| Value | Count | Frequency (%) | |
| 101 | 1 | 0.2% | |
| 102 | 1 | 0.2% | |
| 103 | 1 | 0.2% | |
| 106 | 3 | 0.6% | |
| 108 | 7 | 1.5% | |
| 109 | 1 | 0.2% | |
| 110 | 4 | 0.9% | |
| 112 | 7 | 1.5% | |
| 114 | 12 | 2.6% | |
| 116 | 8 | 1.7% |
| Value | Count | Frequency (%) | |
| 218 | 1 | 0.2% | |
| 216 | 1 | 0.2% | |
| 214 | 1 | 0.2% | |
| 208 | 3 | 0.6% | |
| 206 | 2 | 0.4% | |
| 200 | 1 | 0.2% | |
| 198 | 1 | 0.2% | |
| 194 | 2 | 0.4% | |
| 190 | 2 | 0.4% | |
| 188 | 1 | 0.2% |
| Distinct count | 214 |
|---|---|
| Unique (%) | 46.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.635649350649351 |
|---|---|
| Minimum | 0.0 |
| Maximum | 31.2 |
| Zeros | 107 |
| Zeros (%) | 23.2% |
| Memory size | 3.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.0525 |
| median | 2 |
| Q3 | 5.5 |
| 95-th percentile | 12.49 |
| Maximum | 31.2 |
| Range | 31.2 |
| Interquartile range (IQR) | 5.4475 |
Descriptive statistics
| Standard deviation | 4.593024078 |
|---|---|
| Coefficient of variation (CV) | 1.263329776 |
| Kurtosis | 5.968107866 |
| Mean | 3.635649351 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 2.079209667 |
| Sum | 1679.67 |
| Variance | 21.09587018 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 107 | 23.2% | |
| 6 | 11 | 2.4% | |
| 3 | 10 | 2.2% | |
| 0.4 | 8 | 1.7% | |
| 4 | 8 | 1.7% | |
| 4.2 | 7 | 1.5% | |
| 4.5 | 7 | 1.5% | |
| 1.2 | 5 | 1.1% | |
| 0.6 | 5 | 1.1% | |
| 2 | 5 | 1.1% | |
| 1.5 | 5 | 1.1% | |
| 12 | 5 | 1.1% | |
| 8.8 | 4 | 0.9% | |
| 0.12 | 4 | 0.9% | |
| 5.6 | 4 | 0.9% | |
| 7.5 | 4 | 0.9% | |
| 5.5 | 4 | 0.9% | |
| 0.05 | 4 | 0.9% | |
| 2.6 | 3 | 0.6% | |
| 1.8 | 3 | 0.6% | |
| 0.8 | 3 | 0.6% | |
| 2.8 | 3 | 0.6% | |
| 0.5 | 3 | 0.6% | |
| 0.28 | 3 | 0.6% | |
| 10.5 | 3 | 0.6% | |
| Other values (189) | 234 | 50.6% |
| Value | Count | Frequency (%) | |
| 0 | 107 | 23.2% | |
| 0.01 | 1 | 0.2% | |
| 0.02 | 1 | 0.2% | |
| 0.03 | 1 | 0.2% | |
| 0.04 | 2 | 0.4% | |
| 0.05 | 4 | 0.9% | |
| 0.06 | 1 | 0.2% | |
| 0.07 | 1 | 0.2% | |
| 0.08 | 2 | 0.4% | |
| 0.09 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 31.2 | 1 | 0.2% | |
| 27.4 | 1 | 0.2% | |
| 25.01 | 1 | 0.2% | |
| 20 | 2 | 0.4% | |
| 19.6 | 1 | 0.2% | |
| 19.45 | 1 | 0.2% | |
| 19.2 | 1 | 0.2% | |
| 18.2 | 1 | 0.2% | |
| 18 | 1 | 0.2% | |
| 16 | 1 | 0.2% |
Ldl
Real number (ℝ≥0)
| Distinct count | 329 |
|---|---|
| Unique (%) | 71.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.740324675324675 |
|---|---|
| Minimum | 0.98 |
| Maximum | 15.33 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.7 KiB |
Quantile statistics
| Minimum | 0.98 |
|---|---|
| 5-th percentile | 2.1945 |
| Q1 | 3.2825 |
| median | 4.34 |
| Q3 | 5.79 |
| 95-th percentile | 8.404 |
| Maximum | 15.33 |
| Range | 14.35 |
| Interquartile range (IQR) | 2.5075 |
Descriptive statistics
| Standard deviation | 2.070909161 |
|---|---|
| Coefficient of variation (CV) | 0.4368707426 |
| Kurtosis | 2.876552943 |
| Mean | 4.740324675 |
| Median Absolute Deviation (MAD) | 1.195 |
| Skewness | 1.31310398 |
| Sum | 2190.03 |
| Variance | 4.288664753 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 3.95 | 5 | 1.1% | |
| 4.37 | 5 | 1.1% | |
| 3.57 | 5 | 1.1% | |
| 2.4 | 4 | 0.9% | |
| 3.58 | 4 | 0.9% | |
| 3.3 | 4 | 0.9% | |
| 4.16 | 4 | 0.9% | |
| 3.14 | 3 | 0.6% | |
| 1.88 | 3 | 0.6% | |
| 4.9 | 3 | 0.6% | |
| 3.17 | 3 | 0.6% | |
| 5.9 | 3 | 0.6% | |
| 4.19 | 3 | 0.6% | |
| 3.79 | 3 | 0.6% | |
| 6.06 | 3 | 0.6% | |
| 2.42 | 3 | 0.6% | |
| 3.69 | 3 | 0.6% | |
| 5.63 | 3 | 0.6% | |
| 2.28 | 3 | 0.6% | |
| 4.89 | 3 | 0.6% | |
| 4.55 | 3 | 0.6% | |
| 3.12 | 3 | 0.6% | |
| 3.98 | 3 | 0.6% | |
| 2.44 | 3 | 0.6% | |
| 4.75 | 3 | 0.6% | |
| Other values (304) | 377 | 81.6% |
| Value | Count | Frequency (%) | |
| 0.98 | 1 | 0.2% | |
| 1.07 | 1 | 0.2% | |
| 1.43 | 1 | 0.2% | |
| 1.55 | 1 | 0.2% | |
| 1.59 | 1 | 0.2% | |
| 1.71 | 1 | 0.2% | |
| 1.72 | 1 | 0.2% | |
| 1.74 | 1 | 0.2% | |
| 1.77 | 1 | 0.2% | |
| 1.8 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 15.33 | 1 | 0.2% | |
| 14.16 | 1 | 0.2% | |
| 12.42 | 1 | 0.2% | |
| 11.89 | 1 | 0.2% | |
| 11.61 | 1 | 0.2% | |
| 11.41 | 1 | 0.2% | |
| 11.32 | 1 | 0.2% | |
| 11.17 | 1 | 0.2% | |
| 10.58 | 1 | 0.2% | |
| 10.53 | 1 | 0.2% |
Adiposity
Real number (ℝ≥0)
| Distinct count | 408 |
|---|---|
| Unique (%) | 88.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.4067316017316 |
|---|---|
| Minimum | 6.74 |
| Maximum | 42.49 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.7 KiB |
Quantile statistics
| Minimum | 6.74 |
|---|---|
| 5-th percentile | 12.0065 |
| Q1 | 19.775 |
| median | 26.115 |
| Q3 | 31.2275 |
| 95-th percentile | 37.1165 |
| Maximum | 42.49 |
| Range | 35.75 |
| Interquartile range (IQR) | 11.4525 |
Descriptive statistics
| Standard deviation | 7.780698596 |
|---|---|
| Coefficient of variation (CV) | 0.306245554 |
| Kurtosis | -0.6984386244 |
| Mean | 25.4067316 |
| Median Absolute Deviation (MAD) | 5.7 |
| Skewness | -0.2146459286 |
| Sum | 11737.91 |
| Variance | 60.53927064 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 30.79 | 3 | 0.6% | |
| 29.3 | 3 | 0.6% | |
| 21.1 | 3 | 0.6% | |
| 27.55 | 3 | 0.6% | |
| 30.84 | 2 | 0.4% | |
| 9.69 | 2 | 0.4% | |
| 27.68 | 2 | 0.4% | |
| 23.52 | 2 | 0.4% | |
| 28.11 | 2 | 0.4% | |
| 20.47 | 2 | 0.4% | |
| 37.83 | 2 | 0.4% | |
| 31.29 | 2 | 0.4% | |
| 34.46 | 2 | 0.4% | |
| 15.89 | 2 | 0.4% | |
| 18.96 | 2 | 0.4% | |
| 23.07 | 2 | 0.4% | |
| 25.73 | 2 | 0.4% | |
| 17.33 | 2 | 0.4% | |
| 24.65 | 2 | 0.4% | |
| 32.03 | 2 | 0.4% | |
| 24.83 | 2 | 0.4% | |
| 29.18 | 2 | 0.4% | |
| 16.38 | 2 | 0.4% | |
| 12.13 | 2 | 0.4% | |
| 23.88 | 2 | 0.4% | |
| Other values (383) | 408 | 88.3% |
| Value | Count | Frequency (%) | |
| 6.74 | 1 | 0.2% | |
| 7.12 | 1 | 0.2% | |
| 8.66 | 1 | 0.2% | |
| 9.28 | 1 | 0.2% | |
| 9.37 | 1 | 0.2% | |
| 9.39 | 1 | 0.2% | |
| 9.64 | 1 | 0.2% | |
| 9.69 | 2 | 0.4% | |
| 9.74 | 1 | 0.2% | |
| 10.05 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 42.49 | 1 | 0.2% | |
| 42.17 | 1 | 0.2% | |
| 42.06 | 1 | 0.2% | |
| 41.05 | 1 | 0.2% | |
| 40.6 | 1 | 0.2% | |
| 39.97 | 1 | 0.2% | |
| 39.71 | 1 | 0.2% | |
| 39.68 | 1 | 0.2% | |
| 39.66 | 1 | 0.2% | |
| 39.64 | 1 | 0.2% |
Famhist
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.7 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 270 | 58.4% | |
| 1 | 192 | 41.6% |
Typea
Real number (ℝ≥0)
| Distinct count | 54 |
|---|---|
| Unique (%) | 11.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 53.103896103896105 |
|---|---|
| Minimum | 13 |
| Maximum | 78 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.7 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 36 |
| Q1 | 47 |
| median | 53 |
| Q3 | 60 |
| 95-th percentile | 69 |
| Maximum | 78 |
| Range | 65 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 9.817534116 |
|---|---|
| Coefficient of variation (CV) | 0.1848740834 |
| Kurtosis | 0.4704023399 |
| Mean | 53.1038961 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.3464377547 |
| Sum | 24534 |
| Variance | 96.38397611 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 52 | 25 | 5.4% | |
| 57 | 23 | 5.0% | |
| 50 | 21 | 4.5% | |
| 54 | 21 | 4.5% | |
| 49 | 20 | 4.3% | |
| 56 | 18 | 3.9% | |
| 60 | 18 | 3.9% | |
| 61 | 17 | 3.7% | |
| 47 | 17 | 3.7% | |
| 55 | 17 | 3.7% | |
| 45 | 16 | 3.5% | |
| 46 | 16 | 3.5% | |
| 51 | 15 | 3.2% | |
| 48 | 14 | 3.0% | |
| 58 | 14 | 3.0% | |
| 53 | 14 | 3.0% | |
| 42 | 13 | 2.8% | |
| 59 | 13 | 2.8% | |
| 63 | 11 | 2.4% | |
| 64 | 11 | 2.4% | |
| 65 | 11 | 2.4% | |
| 62 | 9 | 1.9% | |
| 66 | 9 | 1.9% | |
| 41 | 9 | 1.9% | |
| 69 | 7 | 1.5% | |
| Other values (29) | 83 | 18.0% |
| Value | Count | Frequency (%) | |
| 13 | 1 | 0.2% | |
| 20 | 1 | 0.2% | |
| 25 | 1 | 0.2% | |
| 26 | 1 | 0.2% | |
| 28 | 1 | 0.2% | |
| 29 | 1 | 0.2% | |
| 30 | 2 | 0.4% | |
| 31 | 2 | 0.4% | |
| 32 | 1 | 0.2% | |
| 33 | 4 | 0.9% |
| Value | Count | Frequency (%) | |
| 78 | 1 | 0.2% | |
| 77 | 1 | 0.2% | |
| 75 | 1 | 0.2% | |
| 74 | 2 | 0.4% | |
| 73 | 2 | 0.4% | |
| 72 | 4 | 0.9% | |
| 71 | 2 | 0.4% | |
| 70 | 5 | 1.1% | |
| 69 | 7 | 1.5% | |
| 68 | 6 | 1.3% |
Obesity
Real number (ℝ≥0)
| Distinct count | 400 |
|---|---|
| Unique (%) | 86.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.04411255411255 |
|---|---|
| Minimum | 14.7 |
| Maximum | 46.58 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.7 KiB |
Quantile statistics
| Minimum | 14.7 |
|---|---|
| 5-th percentile | 20.17 |
| Q1 | 22.985 |
| median | 25.805 |
| Q3 | 28.4975 |
| 95-th percentile | 33.138 |
| Maximum | 46.58 |
| Range | 31.88 |
| Interquartile range (IQR) | 5.5125 |
Descriptive statistics
| Standard deviation | 4.213680227 |
|---|---|
| Coefficient of variation (CV) | 0.161790125 |
| Kurtosis | 2.255971618 |
| Mean | 26.04411255 |
| Median Absolute Deviation (MAD) | 2.71 |
| Skewness | 0.9052194041 |
| Sum | 12032.38 |
| Variance | 17.75510105 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 24.86 | 4 | 0.9% | |
| 26.09 | 4 | 0.9% | |
| 22.01 | 3 | 0.6% | |
| 27.29 | 3 | 0.6% | |
| 21.94 | 3 | 0.6% | |
| 28.4 | 3 | 0.6% | |
| 24.7 | 3 | 0.6% | |
| 22.59 | 3 | 0.6% | |
| 24.98 | 3 | 0.6% | |
| 25.99 | 3 | 0.6% | |
| 22.51 | 3 | 0.6% | |
| 31.44 | 2 | 0.4% | |
| 30.01 | 2 | 0.4% | |
| 23.63 | 2 | 0.4% | |
| 28.63 | 2 | 0.4% | |
| 25.63 | 2 | 0.4% | |
| 30.31 | 2 | 0.4% | |
| 20.28 | 2 | 0.4% | |
| 27.36 | 2 | 0.4% | |
| 22.65 | 2 | 0.4% | |
| 24.49 | 2 | 0.4% | |
| 29.38 | 2 | 0.4% | |
| 28.07 | 2 | 0.4% | |
| 23.23 | 2 | 0.4% | |
| 23.37 | 2 | 0.4% | |
| Other values (375) | 399 | 86.4% |
| Value | Count | Frequency (%) | |
| 14.7 | 1 | 0.2% | |
| 17.75 | 1 | 0.2% | |
| 17.81 | 1 | 0.2% | |
| 17.89 | 1 | 0.2% | |
| 18.36 | 1 | 0.2% | |
| 18.46 | 1 | 0.2% | |
| 18.5 | 1 | 0.2% | |
| 18.75 | 1 | 0.2% | |
| 19.15 | 1 | 0.2% | |
| 19.3 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 46.58 | 1 | 0.2% | |
| 45.72 | 1 | 0.2% | |
| 41.76 | 1 | 0.2% | |
| 40.34 | 1 | 0.2% | |
| 38.8 | 1 | 0.2% | |
| 37.71 | 1 | 0.2% | |
| 37.41 | 1 | 0.2% | |
| 37.24 | 1 | 0.2% | |
| 36.46 | 1 | 0.2% | |
| 36.06 | 1 | 0.2% |
| Distinct count | 249 |
|---|---|
| Unique (%) | 53.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.044393939393935 |
|---|---|
| Minimum | 0.0 |
| Maximum | 147.19 |
| Zeros | 110 |
| Zeros (%) | 23.8% |
| Memory size | 3.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.51 |
| median | 7.51 |
| Q3 | 23.8925 |
| 95-th percentile | 66.8495 |
| Maximum | 147.19 |
| Range | 147.19 |
| Interquartile range (IQR) | 23.3825 |
Descriptive statistics
| Standard deviation | 24.48105869 |
|---|---|
| Coefficient of variation (CV) | 1.43631148 |
| Kurtosis | 6.421109969 |
| Mean | 17.04439394 |
| Median Absolute Deviation (MAD) | 7.51 |
| Skewness | 2.312698937 |
| Sum | 7874.51 |
| Variance | 599.3222347 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 110 | 23.8% | |
| 2.06 | 16 | 3.5% | |
| 0.51 | 8 | 1.7% | |
| 11.11 | 5 | 1.1% | |
| 43.2 | 5 | 1.1% | |
| 8.23 | 5 | 1.1% | |
| 14.4 | 5 | 1.1% | |
| 8.33 | 5 | 1.1% | |
| 4.11 | 4 | 0.9% | |
| 3.81 | 4 | 0.9% | |
| 1.03 | 4 | 0.9% | |
| 12.86 | 4 | 0.9% | |
| 6.17 | 3 | 0.6% | |
| 23.66 | 3 | 0.6% | |
| 2.78 | 3 | 0.6% | |
| 2.49 | 3 | 0.6% | |
| 11.83 | 3 | 0.6% | |
| 10.49 | 3 | 0.6% | |
| 13.37 | 3 | 0.6% | |
| 4.63 | 3 | 0.6% | |
| 21.6 | 3 | 0.6% | |
| 18.51 | 3 | 0.6% | |
| 18.72 | 2 | 0.4% | |
| 3.6 | 2 | 0.4% | |
| 28.8 | 2 | 0.4% | |
| Other values (224) | 251 | 54.3% |
| Value | Count | Frequency (%) | |
| 0 | 110 | 23.8% | |
| 0.19 | 1 | 0.2% | |
| 0.26 | 1 | 0.2% | |
| 0.37 | 2 | 0.4% | |
| 0.51 | 8 | 1.7% | |
| 0.6 | 1 | 0.2% | |
| 0.68 | 2 | 0.4% | |
| 0.69 | 1 | 0.2% | |
| 0.74 | 2 | 0.4% | |
| 0.86 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 147.19 | 1 | 0.2% | |
| 145.29 | 1 | 0.2% | |
| 144 | 1 | 0.2% | |
| 120.03 | 1 | 0.2% | |
| 109.8 | 1 | 0.2% | |
| 108 | 1 | 0.2% | |
| 100.32 | 1 | 0.2% | |
| 97.2 | 1 | 0.2% | |
| 92.62 | 1 | 0.2% | |
| 90.93 | 1 | 0.2% |
Age
Real number (ℝ≥0)
| Distinct count | 49 |
|---|---|
| Unique (%) | 10.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.816017316017316 |
|---|---|
| Minimum | 15 |
| Maximum | 64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.7 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 17 |
| Q1 | 31 |
| median | 45 |
| Q3 | 55 |
| 95-th percentile | 62 |
| Maximum | 64 |
| Range | 49 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 14.60895644 |
|---|---|
| Coefficient of variation (CV) | 0.3412030675 |
| Kurtosis | -1.01622901 |
| Mean | 42.81601732 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.3817342585 |
| Sum | 19781 |
| Variance | 213.4216084 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 16 | 20 | 4.3% | |
| 58 | 17 | 3.7% | |
| 17 | 17 | 3.7% | |
| 61 | 16 | 3.5% | |
| 59 | 16 | 3.5% | |
| 55 | 16 | 3.5% | |
| 60 | 15 | 3.2% | |
| 49 | 14 | 3.0% | |
| 53 | 14 | 3.0% | |
| 45 | 14 | 3.0% | |
| 64 | 13 | 2.8% | |
| 38 | 13 | 2.8% | |
| 42 | 13 | 2.8% | |
| 48 | 13 | 2.8% | |
| 40 | 12 | 2.6% | |
| 62 | 12 | 2.6% | |
| 32 | 11 | 2.4% | |
| 46 | 11 | 2.4% | |
| 27 | 11 | 2.4% | |
| 52 | 10 | 2.2% | |
| 54 | 10 | 2.2% | |
| 41 | 10 | 2.2% | |
| 39 | 10 | 2.2% | |
| 33 | 9 | 1.9% | |
| 56 | 9 | 1.9% | |
| Other values (24) | 136 | 29.4% |
| Value | Count | Frequency (%) | |
| 15 | 3 | 0.6% | |
| 16 | 20 | 4.3% | |
| 17 | 17 | 3.7% | |
| 18 | 8 | 1.7% | |
| 19 | 2 | 0.4% | |
| 20 | 6 | 1.3% | |
| 21 | 3 | 0.6% | |
| 23 | 2 | 0.4% | |
| 24 | 6 | 1.3% | |
| 25 | 4 | 0.9% |
| Value | Count | Frequency (%) | |
| 64 | 13 | 2.8% | |
| 63 | 8 | 1.7% | |
| 62 | 12 | 2.6% | |
| 61 | 16 | 3.5% | |
| 60 | 15 | 3.2% | |
| 59 | 16 | 3.5% | |
| 58 | 17 | 3.7% | |
| 57 | 8 | 1.7% | |
| 56 | 9 | 1.9% | |
| 55 | 16 | 3.5% |
target
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.7 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 302 | 65.4% | |
| 1 | 160 | 34.6% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Sbp | Tobacco | Ldl | Adiposity | Famhist | Typea | Obesity | Alcohol | Age | target | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 160 | 12.00 | 5.73 | 23.11 | 1 | 49 | 25.30 | 97.20 | 52 | 1 |
| 1 | 144 | 0.01 | 4.41 | 28.61 | 0 | 55 | 28.87 | 2.06 | 63 | 1 |
| 2 | 118 | 0.08 | 3.48 | 32.28 | 1 | 52 | 29.14 | 3.81 | 46 | 0 |
| 3 | 170 | 7.50 | 6.41 | 38.03 | 1 | 51 | 31.99 | 24.26 | 58 | 1 |
| 4 | 134 | 13.60 | 3.50 | 27.78 | 1 | 60 | 25.99 | 57.34 | 49 | 1 |
| 5 | 132 | 6.20 | 6.47 | 36.21 | 1 | 62 | 30.77 | 14.14 | 45 | 0 |
| 6 | 142 | 4.05 | 3.38 | 16.20 | 0 | 59 | 20.81 | 2.62 | 38 | 0 |
| 7 | 114 | 4.08 | 4.59 | 14.60 | 1 | 62 | 23.11 | 6.72 | 58 | 1 |
| 8 | 114 | 0.00 | 3.83 | 19.40 | 1 | 49 | 24.86 | 2.49 | 29 | 0 |
| 9 | 132 | 0.00 | 5.80 | 30.96 | 1 | 69 | 30.11 | 0.00 | 53 | 1 |
Last rows
| Sbp | Tobacco | Ldl | Adiposity | Famhist | Typea | Obesity | Alcohol | Age | target | |
|---|---|---|---|---|---|---|---|---|---|---|
| 452 | 154 | 5.53 | 3.20 | 28.81 | 1 | 61 | 26.15 | 42.79 | 42 | 0 |
| 453 | 124 | 1.60 | 7.22 | 39.68 | 1 | 36 | 31.50 | 0.00 | 51 | 1 |
| 454 | 146 | 0.64 | 4.82 | 28.02 | 0 | 60 | 28.11 | 8.23 | 39 | 1 |
| 455 | 128 | 2.24 | 2.83 | 26.48 | 0 | 48 | 23.96 | 47.42 | 27 | 1 |
| 456 | 170 | 0.40 | 4.11 | 42.06 | 1 | 56 | 33.10 | 2.06 | 57 | 0 |
| 457 | 214 | 0.40 | 5.98 | 31.72 | 0 | 64 | 28.45 | 0.00 | 58 | 0 |
| 458 | 182 | 4.20 | 4.41 | 32.10 | 0 | 52 | 28.61 | 18.72 | 52 | 1 |
| 459 | 108 | 3.00 | 1.59 | 15.23 | 0 | 40 | 20.09 | 26.64 | 55 | 0 |
| 460 | 118 | 5.40 | 11.61 | 30.79 | 0 | 64 | 27.35 | 23.97 | 40 | 0 |
| 461 | 132 | 0.00 | 4.82 | 33.41 | 1 | 62 | 14.70 | 0.00 | 46 | 1 |